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Abstract. Consider a random recusive tree with n vertices. We show 
that the number of vertices with even depth is asymptotically normal as 
n — > oo. The same is true for the number of vertices of depth divisible 
by m for m — 3, 4 or 5; in all four cases the variance grows linearly. On 
the other hand, for m > 7, the number is not asymptotically normal, 
and the variance grows faster than linear in n. The case m = 6 is 
intermediate: the number is asymptotically normal but the variance is 
of order n log n. 

This is a simple and striking example of a type of phase transition 
that has been observed by other authors in several cases. We prove, and 
perhaps explain, this non-intuitive behavious using a translation to a 
generalized Polya urn. 

Similar results hold for a random binary search tree; now the number 
of vertices of depth divisible by m is asymptotically normal for m < 8 
but not for m > 9, and the variance grows linearly in the first case both 
faster in the second. (There is no intermediate case.) 

In contrast, we show that for conditioned Galton- Watson trees, in- 
cluding random labelled trees and random binary trees, there is no such 
phase transition: the number is asymptotically normal for every m. 



1. Introduction 

Given a rooted tree T, let Xj(T) be the number of vertices of depth j 
(i.e., of distance j from the root). The sequence (Xj)jL Q is called the profile 
of the tree and has been studied for various types of random trees by many 
authors, see e.g. [H 13 El HH HH1 CSl ISO] 

We will here study the congruence class of the depth modulo some given 
integer m > 2. Thus, let Xj'(T) := J2k=j (modm)^(^) ^ e tne number 

(2) 

of vertices of depth congruent to j modulo m. For example, Xq (T) is the 
number of vertices of even depth. We further let X^ m ) (T) denote the vector 

(xtHn...,xw 1 (T)). 

The purpose of this paper is to study the asymptotic distribution as 
n — > oo of the random vector X^ m ^(T n ) for some random trees T n with n 
vertices. For a random recursive tree (RRT), see Section |21 for definitions, 
we will show (Theorem 12. lj) that X^" 1 ) is asymptotically normal for m < 6 
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but not for m > 7; furthermore, the variance grows linearly for m < 5 but 
faster for m > 6. For a random binary search tree (BST), the result is sim- 
ilar (Theorem 12.7(1 : X( m ) is asymptotically normal for m < 8, but not for 
m > 9; moreover, the variance grows linearly for m < 8 but faster for larger 
m. In contrast, for a conditioned Galton-Watson tree (CGWT), X^" 1 ) is 
asymptotically normal for every m (Theorem 12.12(1 . 

Note that the typical depths are of the order logn in a RRT or BST, 
but y/n in a CGWT. There is thus more room for smoothing between the 
congruence classes in the latter case, which may explain the asymptotic 
normality in that case, but we see no intuitive explanation for the difference 
between small and large m for the RRT and BST. 

We do not claim that the variables Xj 71 ^ have any importance in applica- 
tions, but they provide a simple and surprising example of a type of phase 
transition that has been observed in several similar combinatorial situations. 
One well-known such example is random m-ary search trees, see Chern and 
Hwang jllj : see also e.g. Mahmoud and Pittel [31j . Lew and Mahmoud 29 , 
Fill and Kapur , Chauvin and Pouyanne [H] . Other examples are random 
quadtrees, see Chern, Fuchs and Hwang jlUj . and random fragmentation 
trees, see Dean and Majumdar |12j and Janson and Neininger |26| . 

We will use a translation into generalized Polya urns; for such urns, there 
is a general theorem describes the phase transition in terms of the size of 
the real part of the second largest eigenvalue, see Athreya and Karlin 2 or 
Athreya and Ney [5J §V.9]; see also Kesten and Stigum j^Tj and, for more 
details, [13] • The same type of phase transition in a related (and overlap- 
ping) setting is described in [231 • We hope that our simple example can help 
to illustrate this surprising and non-intuitive phenomenon. It provides also 
an illustration of the results in in a simple concrete situation. 

We state the results in Section [21 Proofs are given in the following three 
sections, with one type of random trees in each. The proofs for RRT and 
BST in Sections [31 and El are very similar and uses generalized Polya urns, 
while the proof for CGWT in Section [5j uses generating functions and singu- 
larity analysis. Finally, in Section H3 we show that the oscillations shown in 
the results for RRT and BST except for small m are genuine and not only 
an artefact of the proof or of the normalization. 

Acknowledgement. This research was mainly done during the workshop 
Probability theory on trees and analysis of algorithms at the Mathematisches 
Forschungsinstitut Oberwolfach in August 2004. I thank the participants, 
in particular Hsien-Kuei Hwang, for inspiring comments. 

2. Definitions and results 

Let 1 denote the m-dimensional vector (1,...,1). (We do not always 
distinguish between row and column vectors in our notation.) 

We will occasionally deal with complex random variables. A complex 
Gaussian variable is a complex random variable £ such that Re £ and Im £ are 
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jointly Gaussian (i.e., normal). A complex Gaussian variable £ is symmetric 
if and only if E £ = E £ 2 = 0; its distribution is then determined by the scale 
factor E |C| 2 . See further jSH Section 1.4]. 

The discrete Fourier transform on the group Z m = {0, . . . , m — 1} is 
defined by 

m—l 

f(k) = £ J* f{j), (2.1) 
3=0 

where w = uj m := e 2 "" 1 /" 1 . 

All limits below are as n —* oo. 

2.1. Random recursive trees. A random recursive tree (RRT) with n 
vertices is a random rooted tree obtained by starting with a single root and 
then adding n — 1 vertices one by one, each time joining the new vertex to a 
randomly chosen old vertex; the random choices are uniform and indepen- 
dent of each other. If the vertices are labelled 1, 2, . . . , we thus obtain a tree 
where the labels increase along each branch as we travel from the root; the 
random recursive tree can also be defined as a (uniform) randomly chosen 
such labelled tree. (The distribution of a random recursive tree differs from 
the distribution of a uniform random labelled tree.) See also the survey |33| . 

We state the main results for RRT in the following theorem, and add 
various details in the remarks after it. 

Theorem 2.1. Let T n be a random recursive tree with n vertices. Then, 
the following holds for X( m ) = X( m )(T n ), asn^oo. 

(i) If 2 < m < 5, then 

n -l/2 ( X (m) _ at( , E m ) (2.2) 

for a covariance matrix S m given explicitly in Q2.5|) - Q2.8|) . 

(ii) If m = 6, then 



(nlnrO-WxW-ILl) ^iV(0,£ 6 ), 



n 
m 

where Eg is given explicitly in (|2.9|) . 

(iii) If m > 7, let a = cos(2-7r/m) > 1/2 and (3 = sin(2-7r/m). Then 

n - a f x (m) _ _ Re ( n iPw m ) ±5+ o, (2.3) 
V m J 

for some complex random vector W m = ^Z m e~ 27r ^ m ^_Q , where Z m is a 
complex random variable. In particular, along any subsequence such that 
(3\un mod 2ir — > 7 for some 7 € [0, 2ir], 

n -*(x(m) Re^^M^), (2.4) 

jointly in j = 0, . . . , m — 1. 
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Remark 2.2. The covariance matrices T, m in (i) and 



(ii) are given by 



£3 = 
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(2.5) 
(2.6) 

(2.7) 



(2.8) 



(2.9) 



Note that the matrices S m are circulant, i.e. invariant under a cyclic shift 
of both rows and columns. In particular, has the same asymptotic 

distribution for every j (when m < 6). 

Remark 2.3. The distribution of Z m = Z in (hi) is determined by the 
equalities in distribution 



Z 



l -W"Z, 



Z = f/ w (Z + wZ'), 



(2.10) 
(2.11) 



where u = a + pi = e 2ni / m , Z' = Z,W ~ Exp(l), [/ ~ f7(0, 1), and W, Z, 
U, Z, Z' are independent, together with EZ = 1. 
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The moments of Z and Z are all finite, and can be obtained recursively 
from (j2.11j) and (|2.10j) together with EZ = 1. In particular, 

EZ =1, EZ = — -, 

mT(l + uj) 

EZ 2 = EZ 2 — b 



2-u' m 2 (2-w)r(l + 2u;)' 

E|Z| 2 = ^-, Eizr 



2a -1' 11 m 2 (2a-l)r(l + 2a) : 



(3-w 2 )(2-cj)' ~ m 3 (3-w 2 )(2-w)r(l + 3w)' 

Remark 2.4. It follows from |23l Section 2] that all moments of the vari- 
ables in Theorem 12 . 1 1 st av bounded as n — > oo, and thus moment convergence 
holds in (i) and (ii) and for convergent subsequences in (iii). In particular, 

the variance of each Xj is of order n for m < 5, but larger for m > 6. 

Remark 2.5. The covariance matrix T, m has rank m — 1 when m < 5, 
while ^6 has rank 2 only. (Full rank is impossible because all row sums are 
0, reflecting the fact that the total number of vertices is non-random.) 

Remark 2.6. The results for the RRT T n become simpler if we state them 
in terms of the discrete Fourier transform x( m ) = (X( m \k))™ 1 defined by 
(|2.ip . (See also where the proof is based on this Fourier transform.) 

Clearly, X^(0) = T.jXf^ = \T n \ = n is deterministic. 

For 2 < m < 5, Theorem 12.11 implies (and is equivalent to) the joint 
convergence 

n -i/2jrM(jfe) _1> V*, k = l,...,m-l, (2.12) 

where Vk are complex (jointly) Gaussian variables such that, for k, I £ 
{1, . . . , m — 1}, using (j3,lj) below, 

(i) V m ~k = V k , 

(ii) EV k = 0, 

(iii) E(V k Vi) = when l^m-k, 

(iv) E |Vfe| - - _ 2ReuJ k i ~ l -2cos(27r/c/m)' 
It follows that further: 

(v) If k = m/2 (and thus u;^ = —1), then = Vk is a symmetric real 
Gaussian variable. 

(vi) If k ^ m/2, then EV 2 = and thus Vk is a symmetric complex 
Gaussian variable. 

(vii) The Gaussian variables Vk, 1 < k < m/2, are independent. 

Note that the joint distribution of V%, . . . , V m -\ is determined by (iv)-(vii) 
together with (i). 
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Similarly, for m = 6, 

(nlnnr^X^ik) V k , k=l,...,5, 

where now, however, only V\ and V5 = V\ are non-zero; V\ is a symmetric 
complex Gaussian variable with E |Vi| 2 = 1. 
Finally, for m > 7, we have 

n - m jH (1) - ™2 m , (2.13) 
while rT^X^ (Jfe) for A; = 2, . . . , m - 2. 

2.2. Binary search trees. A Binary search tree (BST) is constructed from 
a sequence a?i, . . . , x n of distinct real numbers as follows, see e.g. [2H1 Section 
6.2.2]. If n = 0, the tree is empty. Otherwise, start with a root. (In 
computer applications, x\ is stored in the root.) Then construct recursively 
two subtrees of the root by the same procedure applied to two subsequences 
of xi, . . . , x n : the left subtree from the Xi with Xi < x\ and the right subtree 
from the Xi with x% > x\. The number of vertices in the tree is thus n, and 
each vertex corresponds to an Xj. 

A random BST is obtained by this construction applied to a sequence 
xi, . . . , x n in random order. (Since only the order properties of xi, . . . , x n 
matter, we can let them be, for example, either a random permutation of 
1, . . . , n or n i.i.d. random variables with a common continuous distribution.) 

It is easily seen that a random BST can be grown by adding vertices one 
by one according to a Markov process similar to the definition of the RRT: 
Given a binary tree with n vertices, there are n + 1 possible positions for a 
new vertex, and we choose one of them at random (uniformly). 

Theorem 2.7. Let T n be a random binary search tree with n vertices. Then, 
the following holds for X*" 1 ) = X( m )(T ra ), as n — > 00. 

(i) If 2<m< 8, then 

^ 2 fx (m) --i) ^iV(0,S m ) 

V m / 

for a covariance matrix S m given explicitly in Remark \2.8i 

(ii) If m > 9, let a = 2 cos(27r/m) — 1 > 1/2 and f3 = 2 sin(27r/m). Then 

n ~ a ^ x (m) _ n.^ _ Re (n if3 W m ) ^ 0, (2.14) 

for some complex random vector W m = (•^ m e _27rJ1 ' /m )^_Q 1 , where Z m is a 
complex random variable. In particular, along any subsequence such that 
/31nn mod 2ir — > 7 for some 7 € [0, 2ir], 

n -a( X (rn) _ n\ Re ( e ^^/ m )Z m ), (2.15) 

V 3 m) 

jointly in j = 0, . . . ,m — 1. 
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Remark 2.8. The covariance matrices S m in (i) are circulant and explicitly 
given by the following first rows: 



m 
m 
m 
m 



m = 6 



m 
m 



7 



m(W), 

£(2,-1,-1), 

3jg (17, —3, —11, —3), 

2^(16,1,-9,-9,1), 

^(89,23,-37, -61,-37,23), 

637(62, 27, -15, -43, -43, -15, 27), 

^(269, 165, -11, -171, -235, -171, -11, 165). 



Remark 2.9. The distribution of Z, 
equalities in distribution 

d 



Z in (ii) is determined by the 



Z=%(2u>-l)W Ju - L Z, 
Z = loU 2uj - 1 (Z + Z'), 



(2.16) 
(2.17) 



where to = oj m = e 27ri / m , Z' = Z,W ~ Exp(l), U ~ 17(0, 1), and W, Z, U, 
Z, Z' are independent, together with EZ = 1. 

Again, the moments of Z and Z are all finite, and can be obtained recur- 
sively from (|2.17|) and (|2.16jl together with EZ = 1. In particular, 

2 



EZ = 1, 



EZ Z 



2J 1 



EIZI 



EZ 3 



4w - 1- 
2 

4a - 3' 



6to b 



1 -iv 2 



EZ 
EZ 2 
E\Z\ 2 
. EZ 3 



m{2uj - 1)T{2uj) ' 
4 

m 2 (2u; - l) 2 r(4w - 1 



m 2 (5-4a)r(4a- 1] 
8 



■E\Z\ 



m?{2Lo - l) 3 r(6w - 2 



■EZ 3 . 



(3w- l-w 3 )(4a; 

Remark 2.10. The proofs in |23| Section 2] apply to this situation too 
and show that all moments of the variables in Theorem 12.71 stay bounded 
as n — > oo, and thus moment convergence holds in (i) and for convergent 
subsequences in (ii). In particular, the variance of each is of order n 

for m < 8, but larger for m > 9. 

Remark 2.11. Just as for the RRT case in Remark 12 .61 the results become 
simpler if we state them in terms of the Fourier transform X^" 1 ). Clearly, 
again A > ( m )(0) = \T n \ = n. 

Further, for 2 < m < 8, (|2.12|) holds, where are complex Gaussian 
variables satisfying (i)-(iii) and (v)— (vii) in Remark 12 . 61 together with 

1 
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Finally, for m > 9, 

n -(2a, m -l)^(m) (1) (2 _ 18) 

while n -(^m-i)x(-m) (fc) ±5^ o for = 2, . . . , m - 2. 

2.3. Conditioned Galton Watson trees. A conditioned Galton-Watson 
tree (CGWT) with n vertices is a random tree obtained as the family tree 
of a Galton-Watson process conditioned on a given total population of n. 
(See e.g. for details.) The Galton-Watson process is defined using an 

offspring distribution; let £ denotes a random variable with this distribution. 
We assume, as usual, E£ = 1 (the Galton-Watson process is critical) and 
< a 2 = Var£ < oo. We assume further, for technical reasons, that 
E e a ^ < oo for some a > 0. (Equivalently, the probability generating function 
<p(z) := Ez^ is analytic in a disc with radius greater than 1.) 

It is well-known [J that the conditioned Galton-Watson trees are the 
same as the simply generated trees |32| . Many combinatorially interesting 
random trees are of this type, with different choices of £, for example labelled 
trees (£ ~ Po(l), a 2 = 1); ordered (=plane) trees (P(£ = k) = 2~ k - 1 , 
a 2 = 2); binary trees (£ ~ Bi(2, 1/2), a 2 = 1/2); complete binary trees 
(P(f = 0) = P(f = 2) = 1/2, a 2 = 1). 

It has been shown by Drmota and Gittenberger ^3] that the profile of 
a CGWT converges, after normalization, to the local time of a Brownian 
excursion. For the congruence classes we have a simpler result. As in many 
other results, see e.g. p]E]; different choices of £ only affects a scaling factor 
in the limit result. 

Theorem 2.12. Let T n be a random conditioned Galton-Watson tree with 
n vertices, with the offspring distribution given by a random variable £ such 
that E£ = 1, a 2 := Var£ > and Ee"^ < oo for some a > 0. Then, for any 
fixed m>2, the following holds for X( m ) = X( m )(T n ) as n — > oo, 

n-WxM- -l) ^iV(0,S m ) 

V TO / 

for a covariance matrix S m = (ct^-) gwen explicitly by 

12m z ' 
for <i <m — 1, < & < m and i + A; ta&en modulo to. 
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Remark 2.13. The covariance matrices S m are circulant. For small m, 
they are explicitly given by the following first rows: 



m = 2 

m = 3 

m = 4 

m = 5 

m = 6 



i?(2,-l,-l), 
64 (5, — 1, — 3, —1), 
g(2, 0,-1,-1,0), 
" 2 r (35, 5, -13,-19,-13,5). 



432 ' 

Remark 2.14. It follows from the proof in Section that moment conver- 
gence holds in Theorem 12. 121 In particular, the variance of each is of 
order n for every m. 

Remark 2.15. Note the curious fact that the asymptotic variance of xj m \ 
n<r 2 (l — m~ 2 )/12, is almost independent of m. 

Remark 2.16. For the Fourier transform X^" 1 ', we again have X( m )(0) = 
\T n \ = n. Further, for any m > 2, (|2. 12f) holds, where Vk are complex 
Gaussian variables satisfying (i)-(iii) and (v)-(vii) in Remark 12.61 together 
with 

I i 

(iy,,) E|Ffc|2 = \T^4^ = 2-2cos(2nk/ m y 

Remark 2.17. As a comparison to the results above, suppose that we 
construct X( m ) by randomly throwing n balls into m urns 0, . . . , m — 1, and 
counting the number of balls in each urn. Then xj m) ~ Bi(n,l/m), and 

the central limit theorem shows that n _1 / 2 (X( m ^ — — l) — —> N(0, E m ) with 
S = (m~ l 5ij — m~ 2 )ij. Moreover, the Fourier transform X^" 1 ) is a sum of 
n i.i.d. complex random variables and the central limit theorem shows that 
(|2.12j) holds, where T4 are complex Gaussian variables satisfying (i)-(iii) and 
(v)-(vii) in E,emark 12.61 together with 

(iv'") K|14| 2 = I- 

It follows from the formulas above that the asymptotic variance of X^ 
(for any j) is smaller for the depths in the random trees that we consider 
than for a random assignment of labels when m is small, but not when m is 
large. More precisely, it is smaller for RRT when m < 5 and for BST when 
m < 7; for CGWT, it is smaller when m < 12/a 2 - 1. (Thus, if a 2 > 4, 
then the asymptotic variance for CGWT is always larger than for a random 
labelling.) 

3. Random recursive trees 

The definition of RRT in Section [21 shows immediately that the distribu- 
tion of depths modulo m is given by the following generalized Polya urn: 
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The urn contains balls with labels 0, ...,m — 1, representing the depths 
modulo m of the vertices. Start with a single ball with label in the urn. 
Then, repeatedly, draw a ball (at random) from the urn, replace it, and if 
the drawn ball had label j, add a new ball with label j + 1 (mod m). 

This urn was studied briefly in [213 Example 7.9], see also [231 Example 
6.3]. We repeat the analysis in [22] with more details. Using the notation 
there we have, with the indices in {0, . . . , m — 1} and addition taken modulo 
m, 

&j = a j = 1j A = (£ji)i?i=o = (^,i+i)ij^cr 

The matrix A is circulant and corresponds to a convolution operator in 
£ 2 (Z m ); hence, A is diagonalized by the characters of the group 7L m . More 
precisely, let uj = ui m := e 27n /"\ Then A has left eigenvectors Uj = {oj jk ) 1 k n ~Q 
and right eigenvectors Vj = (jn~ 1 u)~^y^~Q with eigenvalues urf , for j = 
0, . . . , m— 1. Thus, A has m simple eigenvalues, and the dominant eigenvalue 
Ai is u>° = 1. (Note that the corresponding eigenvectors uq = 1 and vo = 
m~ l l are denoted u\ and v\ in |25|.) We have chosen the normalizations 
such that (uj)j and (vj)j are dual bases; moreover uq and vo satisfy the 
normalizations in [23 (2.2)-(2.3)]. 

The eigenvalues with second largest real part are A2 = uj and A3 = 57. 
Since ReA2 = cos(27r/m) < 1/2 when m < 5, ReA2 = cos(27r/m) = 1/2 
when m = 6, and ReA2 = cos(27r/m) > 1/2 when m > 7, the trichotomy in 
Theorem 12. II follows from [251 Theorems 3.22-3.24]. (The conditions (Al)- 
(A6) there are easily verified, see [23 PP- 180-181].) 

More precisely, in Case (i), the convergence to a normal distribution fol- 
lows by |251 Theorems 3.22]. To find the covariance matrices Xl m , we use 
|25| Lemma 5.3(iii)] (or [251 Lemma 5.3(i) or (ii), Lemma 5.4 and Lemma 
5.5]) . We have D = m~ x I and thus u'jDiik = m~ 1 Uj • Uk = unless j = —k 
(mod m), while u' m _ k Duk = u* k Duk = 1. (We use the notation in [23] that 
u ■ v := u l v, without complex conjugation.) Hence, 

m— 1 ^ 

s m = y~] r v i v h ( 3 - 1 ) 

fr[ l-2Reu;^ 3 3 

and a straightforward evaluation yields the matrices (|2.5|) - (|2.8|) . (Note that 
S m is rational also for m = 5, although Rec^s = (y/b — l)/4 is not. This is 
easily explained by Galois theory.) 

The case m = 2, with A = q), is a special case of the so-called Fried- 
man's urn [20] (studied already by Bernstein [1]), and the result follows 
alternatively directly from Bernstein [1], [S] or Freedman |19j : see also |251 
Example 3.27]. 

In Case (ii), m = 6, we similarly find by |251 Lemma 5.3(iv)] 
S 6 = Vl vl + v b v* b = 6" 2 (2 Re u) 3 f k ) jk , 

which yields (|2~9|) . 
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In Case (iii), (|2.3j) follows from |25| Theorem 3.24], with d = and 
A2 = uj m = a+i/3. Let us now denote the eigenvectors Uj and Vj belonging to 
u J by u^j and v^j. Then f25| Theorem 3.24] further shows that W m belongs 
to the linear span E w of v w = (m -1 a;~ 3 ')j; hence W m = (i? TO u;~ J )™ =0 as 

asserted for some complex random Z m := mT^u^ ■ W m . By |25l Theorem 
3.26], (t2~Ttl holds with Z = u u ■ W u , and W u as in [251 Theorem 3.1]. 

Recall that we start the urn with a single ball with label 0. Let, as 
in |251 Theorem 3.9], W w j be the limit random variable corresponding to 
W w if we instead start with a single ball of type i. By symmetry, W Wj i is 
obtained from W Ut o = W w by a cyclic shift of the components, and thus 
Zi := Uu ■ W Uji = ^Z. By $Z5\ Theorem 3.9(h)], 

Wvj^irQVuj+uW^i+i) (3.2) 
with W^ i+1 distributed as W Uj i+i and independent of W^. Consequently, 

Zi = U^iZi + Z' i+1 ) 4 u*>( Zi + cjZi), 

with Zi, Z' i+1 , Z- independent, and taking i = we obtain Q2.11JI . Moreover, 
EZ = Uu)0 = 1 by 2 5, Theorem 3.10]. 

Conversely, (|2.11|) implies (|3.2|) with W Ut i = ^Zv^, and thus [2^1 Theo- 
rem 3.9(iii)] implies that the distribution of Z is determined by p. llj) and 
EZ. The distribution of Z then is determined by ()2.1(Jj) . 

For higher moments of Z, we take moments in (j2.11|) . For example, 

E Z 2 = E U 2uj E(Z + ujZ') 2 = (1 + 2uj)- 1 ((1 + uj 2 ) E Z 2 + 2cj(E Z) 2 ) , 

which, using E Z = 1, gives EZ 2 = 2/(2- uj) after rearrangement. We leave 
the corresponding calculations for E|Z| 2 and EZ 3 to the reader. See also 
[23 Theorem 3.10]. 

The formulas for moments of Z then follows from (|2.10j) , using E W z = 
T(l + z) when Kez > —1; see also (23 Theorem 3.26] and [221 Section 2]. 

4. Binary search trees 

To describe the profile of the random BST in terms of an urn model, we 
make a simple transformation. A BST with n vertices has n + 1 possible 
positions for a new vertex. We augment the tree by adding n + 1 new 
vertices at these positions; the n + 1 new vertices are called external and the 
n original vertices are called internal. Thus every internal vertex has two 
children, and every external vertex has none. 

Thus, {Xj)JL Q is now the profile of the internal vertices. We similarly 
define (Yj)°5L as the profile of the external vertices, and note that, since 
every internal vertex has exactly two children, 



2X j „i=X j +Y j , j>l. 
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For j = we instead have Xq + Yq = Xq = 1. (Recall that the root has no 
parent.) Passing to congruence classes modulo m we thus have, for every j, 

2X M_ x M =y M_ 5 . o _ (41) 

The growth of the augmented tree can be described as follows: Choose 
an external vertex at random, convert it to an internal vertex and add two 
new external vertices as its children. The distribution of depths for external 
vertices modulo m is thus given by a generalized Polya urn similar to the 
one in Section 12 with the difference that when we draw a ball with label j, 
we remove it and add two balls with label j + (We start with 2 balls with 
label 1; alternatively, we start with a single ball with label and make one 
more draw.) 

The matrix A is now (26ij+\ — <5i,j)7j=o> a g am with index addition mod- 
ulo m. The eigenvectors are the same Uj and Vj as in Section |31 but the 
corresponding eigenvalue is now 2oj 3 — 1. In particular, the largest eigenvalue 
(i.e., the one with largest real part) is Ai = 1 (as before), and the second 
largest are A2 = 2uj — 1 and A2, with real part ReA2 = 2 cos(27r/m) — 1. 

Hence, the condition Re A2 < 1/2 becomes cos(2ir/m) < 3/4, which holds 
for m < 8, while for m > 9 we have cos(27r/m) > 3/4 and thus ReA2 > 1/2. 

We now obtain, exactly as in Section |3J normal convergence of Y( m ) = 
(Y^ m \ . . . ,Y^ l ) when m < 8. More precisely, by (23 Theorem 3.22 and 
Lemma 5.3(iii) or Lemmas 5.3(h) and 5.4], (|2.2|) holds for Y( m ) with 

S - = L 3-4W ^' V * = ^ 3-4W ^ V * ■ (42) 

3=1 3=1 

Similarly, by [23 Theorem 3.24], when m > 9, and ((231) nold for Y^, 
for some Z m and W m = (Z m uj-i) m =0 . Further, by |2S1 Theorems 3.26, 3.9 
and 3.10], 

Z = fW^Zm, (4.3) 
where Z satisfies ()2.17(l and MZ = 1. 

To obtain the results for X^ m \ we use (|4.1|1 . It is convenient to solve this 
convolution equation by taking the Fourier transform, which yields, for all 
k, 

{2io k - l)X^ (k) = Y ( m ) (k)-l. (4.4) 

In the case m < 8, we have in analogy with Eemark 12.61 Y^ m \0) = n + 1 
and the joint convergence 

n- 1/2 Y (m) (k) ^ V k , fc = l,...,m-l, (4.5) 

where Vk are complex Gaussian variables satisfying (i)-(iii) and (v)-(vii) in 
Remark 12.61 together with 

(] E\V \ 2 = ^ 2UJ ™ - l][2 = 5 ~ 4RecJ ^ = 5-4cos(27rfc/m) 
llV ' 1 kl 3-4Rea;^ 3-4Rea;^ 3 - 4cos(2vr/c/m) ' 
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It follows immediately from (|4.4|) and (|4.5|) that 



n 



fc = 1, . . . , m — 1, 



(4.6) 



which yields the statement in Remark 12.111 (for m < 8), with the meaning 
of Vk changed. 

Similarly, for m > 9, we have, with A := 2uj — 1, in analogy with (|2. ^li - 



ra 



^y( m )(l) 



yZ m , and thus 



n 



-*XH(1) = n-^Y^Hl) + o(l) ^ A-^Z m , 



while n A x( m ) (A;) for k = 2, . . . , m— 2. We change the meaning of Z m 
(replacing Z m by XZ m ) and write this as (|2.18|) . simultaneously changing 
(TO) to (t2~TT)l) . 

Theorem 12.71 now follows by taking the inverse Fourier transform. When 
m < 8, we obtain (|2.2|) with 



m— 1 

E 



3-4Reu4 J J ' 



(4.7) 



which gives the explicit values in Remark 12.81 fwith some help of Maple). 

Remark 4.1. The covariance matrices in ()4.2[) for the case of external 
vertices are circulant and explicitly given by the following first rows: 



m 
m 
m 
m 



m = 6 



m 
m 



7 



28^' 

£(14,-7,-7), 

3^(97,-27,-43,-27), 

2^(76,-9,-29,-29,-9), 

^(353, 11, -109, -157, -109, 11), 

^(202, 41, -43, -99, -99, -43, 41), 

^(685, 309, -43, -363, -491, -363, -43, 309). 



5. Conditioned Galton-Watson trees 

For Galton-Watson trees, we use generating functions and singularity 
analysis. See [Hj for similar arguments. Given a tree T, we define its profile 
polynomial by 

S(x) = S(x;T) :=^x d ^ = J>^(T). (5.1) 

vgt j 

We will first find the asymptotic distribution of S(x;T n ) for x on the unit 
circle (excluding the trivial case x = 1). Note that S(x) for |x| = 1 is the 
Fourier transform of the sequence (Xj) as a function on Z. 
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Letting T be a random Galton-Watson tree, we define the generating 
functions, for k > 0, 

k 

F k (t; Xl ,...,x k ):= E(tl r l ]J S( Xi ; T)) . (5.2) 

i=l 

Here i and complex numbers. (It is also possible to regard F k 

as a formal power series, but we will need analytic functions.) We regard 
xi, . . . , x k as fixed and consider F k as a function of t. We consider only Xi 
with \xi\ < 1; then ^(xjjT)) < |T| and the expectation in (|5.2|) exists at 
least for |t| < 1. Thus (|5.2|) defines as an analytic function of t in the unit 
disc \t\ < 1. We will soon see that it can be continued to a larger domain. 

Let Dq be the degree of the root. If we condition on Dq = q > 0, then 
the random tree T consists of the root plus q branches T\ , . . . , T q that are 
independent and have the same distribution as T. Further, \T\ = 1 + X^i l^/'l 
and S(x;T) = 1 + ]Cj=i xS{x\ Tj), and thus, summing over all sequences 
Jo, ...jig of disjoint (possibly empty) subsets of {l,...,k} with (Ji -(? = 
{l,...,k}, 

k q 

t[S( Xi ;T) ^ H H r,.^ r,: 7 ( S. 
i=l i ,...,i q j=neij 

Consequently, 

k q 

E(f\jis(xiiT) | D =q) =e 2 *n(* |Ti| n^ 5 ^;^)) 

i=l f ,... ,J, i=i ieij 

The terms in the latter sum do not depend on the order of 1%, . . . ,I q . Thus, 
if J2*i i denotes the sum over such sequences with I\, . . . ,Ii non- 

empty and in, say, lexicographic order, then, with q- := q{q— 1) ■ ■ ■ (q—l + i), 



E(t^HS( Xi ;T) D = 

i=i 

Z=0 Io,...,h i0 o j=l 

Now take the expectation, i.e. multiply by F(Dq = q) and sum over q. We 
have, for \z\ < 1 at least, since Dq = £ and thus Kz D ° = <p(z), 

£y**-'p(A, = q) = e(d 1 ^z d o- 1 ) = </)(z), 

q>l 
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and thus (|5.H|) yields 

1=0 I ,...,Ii i£I j=l 

In particular, k = yields the well-known formula Fo(t) = t(p(Fo(t)). The 
next two cases are 

F x (t;s) = ttp(F (t)) + V(^o(t))^l(t;^), 
F 2 (t; x, y) = ttp(F {t)) + tip' {F (t))(xF 1 (t; x) + y*i(i; y) + xyF 2 (t; x, y)) 
+ t<p"(F Q (t))xyF 1 (t;x)F 1 (t;y). 

We thus have, recalling tip(Fo(t)) = Fo(t), 

Fi(t;x) = - fif (5.4) 

1 - x^Uw)) 

F 2 {t;x,y) = (1 - xyt^(F (t))) _1 (Fo(t) + t<p'(F (t))(xF l (t;x) + yF 1 (t;y)) 

+ t V "(F (t))xyF 1 (t;x)F 1 (t;y)' 

(5.5) 

and in general 

A' 



F k {t;x 1 ,...,x k ) = (l-t(j[x^<f/(F (t)fj 



i=l 

xt EE ^ ) (i ? o(t))(n^)n^i^{ x - iG/ ^)' ( 5 - e ) 

i=0 / ,...,/i igib i=l 

where ^** means Yl* with the single term with I = 1 and = k omit- 
ted. This gives recursively an explicit formula for each F k (t;xi, . . . ,x k ) as 
a rational function of x±, . . . , x k and t<p®(F (t)), 0<l<k. 

We say, see JSJ Chapter VI], that a A-domain is a domain of the type 
{z : \z\ < 1 + s, | arg(z — 1)| > 7r/2 — 5} for some (small) positive e and <5, 
and that a function is A- analytic if it is analytic in some A-domain, or can 
be extended to such a function. 

Suppose for simplicity in the sequel that £ is aperiodic. (The periodic 
case is similar with standard modifications as in [181 Chapter VI. 7]; we omit 
the details.) Then, by a standard result in singularity analysis, see e.g. |181 
Proposition VI. 1], Fo(t) is A-analytic, with 



F (t) = 1 - ^j2/a 2 {\ - t) 1 ' 2 + 0(1 - t) ast->l. (5.7) 

(Here and below, we consider only t in a suitable A-domain.) It follows 
that in a (possibly smaller) A-domain, |-Fb(i)| < 1 and hence \cp'(Fo(t))\ < 
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(p'(l) = 1. Further, 

tp'(F (t)) = 1 + <p"(l)[F (t) - 1) + O(F (t) - l) 2 

= l-V2a(l-t) 1 / 2 + 0(l-t), (5.8) 

and it follows easily that \tip' (Fo(t))\ < 1 in a A-domain. Hence, for every 
fixed x with \x\ < 1, (l — xtip' (Fo(t))) is A-analytic, with 

( 1 -^)))-^{-r (i " frl/2+o(i) ' <-> 

Lemma 5.1. For A; > 1 and any complex xi,...,Xk with \xi\ < 1 and 
Xi ^ 1, F k (t;xi, . . . , x k ) is A-analytic (as a function oft), with 

F k (t; Xl ,...,x k ) = a k (x u Xk ){l- t )^ k ~ 1 )/ 2 + 0((l-i)-(*- 2 )/ 2 ) , (5.10) 

where a\(x) = 1/(1 — x), and for k > 2, a k (xi, . . . , x^) =0 if x\ ■ ■ ■ x k ^ 1, 
while if x\ ■ ■ ■ x k = 1, 

a k (xi, . . . ,x k ) = 2~ 3/2 a ^ a\j\{xi : i G I)a k _\j\(xi : i ^ I). (5.11) 



0C/C{l,...,fc} 



x 



-2 



In particular, when \x\ = 1, a2(x,x) = 2 1//2 <r|l 

Proof. The A-analyticity follows by ()5.6j) and induction, using the results 
just shown. 

For k = 1, the expansion (|5.10|) follows from ()5.4|) . H5.7|) and ()5.8|) . 

Similarly, (|5TTU|) for A; = 2 follows from (fB3|) together with (f5"7|) . (fS~%)) 
and (|5.9|) : this also yields 02(2;, y) = if xy 7^ 1 and for xy = 1, using also 
¥>"(!) = 

a 2 (x,y) = 2~ 1/2 o-~ 1 (F (l) + xai(x) + yai(y) + a 2 xya 1 (x)ai(y)) , 

which equals 2 _1//2 ct|1 — x| -2 because now |x| = \y\ = 1 and y = x and thus 

x y / x \ 

xai(x) + yai(y) = - h = 2 Re = -1. 

1 — x 1 — y vi — x/ 

For /c > 3 we argue similarly. By induction, all terms in the sum in (|5.6f) 
are 0((1 — t)~( fc-2 )/ 2 ) . The result when x\ - ■ -x k 7^ 1 follows immediately 
by Q. 

Assume now xi • • • X& = 1. If = fc— 1, then Ji = {1, . . . , k]\x p for some 
p, and thus Ilie/i Xi = x p 1 ^ nence by induction F k _i(t; {xj : i G ii}) = 
0((1 - i)-( fe ~ 3 )/ 2 ). The leading terms in (J^SJ) are thus those with / = 2 
and Iq = ft, I2 = {1, . . . ,k}\I%, which proves the claim, including ()5.11|) . by 
another application of 1)5. 9 jl . (Note the factor 1/2 because we assume that 
I\ and I2 are in order, but not necessarily / and its complement.) □ 

We next solve the recursion (|5,11|) . Let N(x±, . . . ,x k ) be the number of 
pairings of x\, . . . , x k into k/2 pairs of the type {x, x}. (Thus N(x±, . . . , x k ) = 
if A; is odd.) 
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Lemma 5.2. Let k > 2. Suppose that xi, . . . , x k G {x G C : \x\ = 1 but x ^ 
1}. Then 

r((fc-i)/2) * a 

a fc (xi, . . . , = -= -iV(xi, ...,#*.) I I — 

\/2na xx 1 



i=l 

Proof. For k = 2, the result follows directly from Lemma 15.11 

For > 3, we use induction. The result is trivial if x\ ■ ■ ■ x k ^ 1. Hence, 
we assume x\ ■ ■ ■ x k = 1 and use Q5.11JI . First, note that it suffices to consider 
I with |/| even. In fact, if |7| is odd and |/| > 3, then Nn\(xi : i £ I) = 0, 
and thus 01/1(0:* : i G I) = by induction. If |J| = 1, then I = {x p } for some 
p. Since x * = x p 1 / 1) we have N k _\j\{xi : i £ I) = 0, and thus by 

induction a k in(xi '■ i £ I) = 0. Similarly, we may assume that k — \I\ is 
even. 

The result thus holds when k is odd (with a k [xi, . . . , x k ) = 0). 

Now let k be even, k = 21 with / > 2. We use induction on the right hand 
side of (|5.11|> . Say that a pairing of x\, . . . ,x k is good if each pair consist of 
two conjugate numbers. Note that Niji(xi : i £ L)N k _\j\{xi : i £ I) equals 
the number of pairs (a, (5) where a is a good pairing of (x{ : i €. I) and 
(3 is a good pairing of (xi : i £ I). Each such pair (ct,/3) defines a good 
pairing of xi,...,x k , and conversely, each good pairing of xi, . . . , Xk splits 
into good pairings a and (3 in 2 l — 2 ways: (.) ways with |/| = 2j for each 
j = 1, . . . , I — 1. Consequently, (|5.11|) yields, 

l-l 

a fc (xi,... ,x k ) = 2~- i/2 o-y ( ". )N(xi,...,x k ) 



r(j-i/2)r(f-j-i/2) -A- a 

2tt(t 2 J-J-l-Xi' 
1=1 * 

To complete the induction step, it is now sufficient to verify 

£ - l/2)T(l-j - 1/2) = 40Fr(/ - 1/2), 

j=l V/ 

for I >2. This is an immediate consequence of the binomial convolution 

„o W1 , won „^r(i-i/2)r(i-i-i/2) 



= /!E(- 1 )' r (-V2) 2 ( 1 f)(^ 2 j ) =«(-l)T(-l/2) 2 Q =0, 

when / > 2, since the terms with j = and j = / both are T(—l/2)T(l — 
1/2) = -2v^r(/ - 1/2). □ 
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Lemma 5.3. Let k > 1. Suppose that xi, . . . , x k G {x G C : \x\ = 1 but x ^ 
1}. TTien 

k 

E(S(xv, T n ) ■ ■ ■ S(x k ;T n )) = N(x\, ...,x k )(j[ J^)^ 2 + 0(n (fc ~ 1)/2 ) . 
Proof. By 

[t n ]F k (t; x 1 ,...,x k ) [t n ]F k (t; x 1 ,...,x k ) 



E(S(xi;T n ) ■ ■ ■ S(x k ;T n )) 

Lancia 
k > 2, 



P(|T| = n) [t n ]F (t) 
Standard singularity analysis see e.g. ^3 Chapter VI], and (|5.1U|) yield, for 



[e\F k (t-X X , ...,X k ) = ^__^) n ( fc -3)/2 + 0(n ( fe -4)/ 2)j 

r((fe-i)/2) 

while for k = 1 we get 

[t n ]Fi(t;x) =0(n- 3 / 2 ). 
Similarly, ()5.7() yields, as is well-known, 

[t n ]F (t) = -^n- 3 / 2 + 0(n~ 5 / 2 ). 

V 27T(7 



The result follows from these formulas and Lemma 15.21 □ 

We can now identify the asymptotic moments and thus the asymptotic 
distribution of S(x;T n ). 

Theorem 5.4. LetU(x) be a family of complex Gaussian random variables, 
defined for \x\ = 1 but x ^ 1, such that 

(i) U (x) is symmetric complex Gaussian when Im x ^ 0, 
to#iE|[/(:e)| 2 = 1/|1-x| 2 ; 

(ii) f7 (a?) is symmetric real Gaussian when Imx = (i.e., when x = — 1), 
with E\ U(x)\ 2 = 1/| 1 -x\ 2 ; 

(iii) 17(5) = U(x); 

(iv) the variables U(x), Imx > 0, are independent. 

Then, for the CGWT, S(x;T n ) / \fn — —* aU{x), jointly for all such x. 

Proof. Note that the assumptions imply that EU{x)U{y) = 1/|1 — x\ 2 if 
xy = 1 (and thus y = x), but E U(x)U(y) = otherwise. By Lemma 1531 
and the formula |24| Theorem 1.28] for joint moments of Gaussian variables 
(known as Wick's theorem), 

n~ k ' 2 E(5(xi; T n ) ■ ■ ■ S(x k ; T n )) a k E(U( Xl ) • • • U(x k )) . 

Replacing one or several xt by their conjugates, we see that the same holds 
if we replace some S and U by their conjugates. Hence the result holds by 
the method of moments (applied to the real and imaginary parts). □ 
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Proof of Theorem \2.1iA As remarked above, for any tree T, the numbers 
S(x;T) for \x\ = 1 form the Fourier transform of the sequence (Xj(T)). 
It follows that the discrete Fourier transform X*-" 1 ) of X( m -*(T) equals the 
vector (S(u k ;T))%-l, where lo = uj m , cf. 1)2- 1|) and 1)5. 1JI . Hence, by Fourier 
inversion, 

1 m—1 

*;(t) = - j>-^ fc ; T). 

k=0 

Theorem 15.41 thus implies, since trivially S(l;T n ) = n, 

m—1 

n-^(x 3 (T n ) Z 3 := * V c^Z/frA 

\ m. / m ^ — ' 



m/ m 

k=l 



jointly for all j. Since the variables U(u k ) are jointly (complex) Gaussian, 
the variables Zj are too; moreover, each Zj is real. Clearly, EZj = 0, and 
the covariance matrix is given by 



o m—1 



E(ZiZj) = E^Z,) = %Y u^ k E\U(u k )\ 2 

k=l 

2 m—1 

^l^^) fc |l-o; fc |- 2 . (5.12) 



k=l 



To evaluate this sum, define a function / on 7L m = {0, . . . , m — 1} by 
f(j) = J ~ ( m ~ Then its Fourier transform is 

m—1 1 

i=o 

while /(0) = 0. 

Let further g ■= f * f on Z m , i.e. = Eigz m /0' + *)/(*)• Then 

g(fc) = | /(&) | 2 = m 2 |w fc — 1| -2 , k 0, and thus, by Fourier inversion again, 

m—1 m—1 

u {j ~ l)k \l - uj k \- 2 = mT 2 oj^ k g{k) = m"Vi - j). 
fe=l fc=0 

Hence, by (|5.12() . E(ZjZj) = a 2 m~ 3 g(i — j). It remains only to evaluate 5. 
For < j < to, 

m-l-j m-l 

m — 1 \ 2 



i=0 i=m—j 



m 2 — 1 j(m — j) 

m m . 

12 2 n 
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6. Oscillations 

In Case (iii) of Theorem 12.11 (m > 7), we do not have convergence 
in distribution: the sequence of random vectors n~ a (X( m ) — — l), n = 
1, 2, . . . , is tight and thus suitable subsequences converge as is shown explic- 
itly in (|2.4j) . but different subsequences may have different limits, and thus 
n — a (^Xq™ 1 ^ — ^-l) , for example, does not have a limiting distribution. Indeed, 
suppose that n - a (X^ m) - £l) V, say. Then, by (f£ij> . Re(e i7 Z m ) = V 
for every 7 G [0,2vr]. In particular, Re(e^EZ m ) = ERe(e^Z m ) = EV is 
independent of 7 G [0, 2tt], which is a contradicion because EZ m 7^ by 
Remark 12.31 

Nevertheless, it is conceivable (although implausible) that Xk ' has a 
limiting distribution if we choose the norming constants carefully, i.e. that 
a n (X^ —b n ) V for some non-degenerate V and real constants a n > 

and 6 n . It then would follow from (|2.4j) that Re(e 17 Z m ) = a 7 y + & 7 for every 
7 G [0, 27r] and some real constants a 7 > and 6 7 , see e.g. [2H Section 9.2]. 
In other words, Re(e 17 Z m ) would have a distribution of the same type for 
every 7, except when it is degenerate. 

To rule this out, we show a general result. 

Proposition 6.1. Let Z be a complex random variable such that E|Z| 3 < 
00. Suppose that there exists a random variable V and, for every 7 G [0, 2tt], 

some real constants a 7 > and 6 7 such that Re(e 17 Z) = a^V + 6 7 . Then 
either 

(i) Z = aW + b for some real random variable W and some complex 
constants a and b, and thus \ E(Z — E Z) 2 \ = E \ Z — E Z\ 2 ; or 

(ii) E(Z - E Zf = E(Z - E Z) 2 (Z-EZ) = 0. 

Proof. If all a 7 = 0, then Re(e 17 (Z — EZ)) = a.s. for every 7, and it 
follows by the Cramer- Wold device that Z = E Z special case of (i). 

Thus assume that some a 7 > 0. Then E|F| 3 < 00. By replacing Z by 
Z — EZ and V by 7-EF (changing 6 7 accordingly) , we may assume that 
EZ = EV = 0, and thus 6 7 = 0. If V = a.s., then, by the Cramer-Wold 
device again, Z = a.s., and (i) holds. Assume thus EV 2 > 0; rescaling V 
we may assume EV 2 = 1. Define 

X 7 := Re(e i7 Z) = a 7 V, 
Z 7 := 2e i7 X 7 = e 2i7 Z + Z. 

We have 

EZ 2 = 4e 2i7 EX 2 = 4e 2i7 a 7 , 
EZ 7 3 = 8e 3i7 E X 3 = 8e 3i7 a 3 E y 3 , 

and thus 

(EZ*) 2 = (EV 3 ) 2 (EZ 2 ) 3 . (6.1) 
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On the other hand, 

EZ 2 = E(e 2i7 Z + Z) 2 = e 4i7 EZ 2 + 2e 2i7 E|Z| 2 + EZ 2 , (6.2) 

E Z 3 = e 6i7 E Z 3 + 3e 4i7 E(Z 2 Z) + 3e 2i7 E(ZZ 2 ) + E Z 3 . (6.3) 

Hence, E Z 2 and E Z 3 are polynomials p 2 ( e217 ) an d P3 (e 217 ) in e 217 of degrees 
at most 2 and 3. By 1)6.1(1 . 

p 3 ^) 2 = (EF 3 ) 2 p 2 (z) 3 (6.4) 

for every z with |^| = 1, and thus for every complex z. 

If E V 3 = 0, then EZ 3 = by (jEEJ) and thus (ii) holds by l|Ol . 

Suppose now EF 3 / 0. If E Z 2 = 0, then pi{z) has degree 1 or by 1)6.2)1 . 
Degree 1 is impossible by (|fi.4|) . and thus E |Z| 2 = by 1)6.2)1 so Z = a.s., 
and both (i) and (ii) hold. 

Finally, if E V 3 / and E Z 2 / 0, then (|6.4j) implies that p 2 has a double 
root, so its discriminant (E |Z| 2 ) 2 - E Z 2 E Z 2 = 0, and |EZ 2 | = E|Z| 2 . It 
follows that the argument of Z 2 is constant a.s., and thus (i) holds. □ 

Returning to Theorem I2.1f iii). we can use the moments computed in 
Remark 12.31 The proposition shows that there really are oscillations, even 
with different normalizations, as soon as | E(Z m — EZ m ) 2 | < E |Z m — EZ m | 2 
and E(Z m - E Z m f ^ 0. 

It should be possible to verify this for all m > 7, perhaps using asymptot- 
ical expansions for large m, but for simplicity we have resorted to numerical 
verification (by Maple) for m < 100. We have also done the same for BST 
and Theorem I2.7f ii). using the moments given in Remark 12.91 We thus 
conclude the following result, showing that at least for these m, there are 
genuine oscillations. 

Theorem 6.2. For RRT, at least for 7 < m < 100, there are oscillations 
in Theorem \2.lV iii): a^X^ — b n ) does not have a non- degenerate limit 
distribution for any sequence of norming constants a n > and b n . The 
same holds for BST, at least for 9 < m < 100. □ 

Remark 6.3. The fact that E(Z m - E Z m ) 3 / also implies by that 
the subsequence limit := Re(e 17 Z) has a non-zero third central moment, 
except for at most 6 values of 7 S [0, 2tt); in particular, X^ is not normal 
except possibly for a few exceptional 7. Presumably, these too could be 
eliminated by considering fourth or fifth moments, but we have not pursued 
that. 
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